Búsqueda | Portal Regional de la BVS

1.

Identification of 6-methyladenosine sites using novel feature encoding methods and ensemble models.

Alromema, Nashwan; Suleman, Muhammad Taseer; Malebary, Sharaf J; Ahmed, Amir; Ali Mohammed Al-Rami Al-Ghamdi, Bandar; Khan, Yaser Daanial.

Sci Rep ; 14(1): 8180, 2024 04 08.

Artículo en Inglés | MEDLINE | ID: mdl-38589431

RESUMEN

N6-methyladenosine (6 mA) is the most common internal modification in eukaryotic mRNA. Mass spectrometry and site-directed mutagenesis, two of the most common conventional approaches, have been shown to be laborious and challenging. In recent years, there has been a rising interest in analyzing RNA sequences to systematically investigate mutated locations. Using novel methods for feature development, the current work aimed to identify 6 mA locations in RNA sequences. Following the generation of these novel features, they were used to train an ensemble of models using methods such as stacking, boosting, and bagging. The trained ensemble models were assessed using an independent test set and k-fold cross validation. When compared to baseline predictors, the suggested model performed better and showed improved ratings across the board for key measures of accuracy.

Asunto(s)

Adenosina , ARN , ARN/genética , ARN Mensajero , Adenosina/genética , Proyectos de Investigación

2.

m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models.

Suleman, Muhammad Taseer; Alturise, Fahad; Alkhalifah, Tamim; Khan, Yaser Daanial.

BioData Min ; 17(1): 4, 2024 Feb 15.

Artículo en Inglés | MEDLINE | ID: mdl-38360720

RESUMEN

BACKGROUND: 1-methyladenosine (m1A) is a variant of methyladenosine that holds a methyl substituent in the 1st position having a prominent role in RNA stability and human metabolites. OBJECTIVE: Traditional approaches, such as mass spectrometry and site-directed mutagenesis, proved to be time-consuming and complicated. METHODOLOGY: The present research focused on the identification of m1A sites within RNA sequences using novel feature development mechanisms. The obtained features were used to train the ensemble models, including blending, boosting, and bagging. Independent testing and k-fold cross validation were then performed on the trained ensemble models. RESULTS: The proposed model outperformed the preexisting predictors and revealed optimized scores based on major accuracy metrics. CONCLUSION: For research purpose, a user-friendly webserver of the proposed model can be accessed through https://taseersuleman-m1a-ensem1.streamlit.app/ .

3.

PseU-Pred: An ensemble model for accurate identification of pseudouridine sites.

Suleman, Muhammad Taseer; Khan, Yaser Daanial.

Anal Biochem ; 676: 115247, 2023 09 01.

Artículo en Inglés | MEDLINE | ID: mdl-37437648

RESUMEN

Pseudouridine (ψ) is reported to occur frequently in all types of RNA. This uridine modification has been shown to be essential for processes such as RNA stability and stress response. Also, it is linked to a few human diseases, such as prostate cancer, anemia, etc. A few laboratory techniques, such as Pseudo-seq and N3-CMC-enriched Pseudouridine sequencing (CeU-Seq) are used for detecting ψ sites. However, these are laborious and drawn-out methods. The convenience of sequencing data has enabled the development of computationally intelligent models for improving ψ site identification methods. The proposed work provides a prediction model for the identification of ψ sites through popular ensemble methods such as stacking, bagging, and boosting. Features were obtained through a novel feature extraction mechanism with the assimilation of statistical moments, which were used to train ensemble models. The cross-validation test and independent set test were used to evaluate the precision of the trained models. The proposed model outperformed the preexisting predictors and revealed 87% accuracy, 0.90 specificity, 0.85 sensitivity, and a 0.75 Matthews correlation coefficient. A web server has been built and is available publicly for the researchers at https://taseersuleman-y-test-pseu-pred-c2wmtj.streamlit.app/.

Asunto(s)

Seudouridina , ARN , Humanos , Seudouridina/metabolismo , Procesamiento Postranscripcional del ARN

4.

iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models.

Suleman, Muhammad Taseer; Alturise, Fahad; Alkhalifah, Tamim; Khan, Yaser Daanial.

Digit Health ; 9: 20552076231165963, 2023.

Artículo en Inglés | MEDLINE | ID: mdl-37009307

RESUMEN

Background: Dihydrouridine (D) is one of the most significant uridine modifications that have a prominent occurrence in eukaryotes. The folding and conformational flexibility of transfer RNA (tRNA) can be attained through this modification. Objective: The modification also triggers lung cancer in humans. The identification of D sites was carried out through conventional laboratory methods; however, those were costly and time-consuming. The readiness of RNA sequences helps in the identification of D sites through computationally intelligent models. However, the most challenging part is turning these biological sequences into distinct vectors. Methods: The current research proposed novel feature extraction mechanisms and the identification of D sites in tRNA sequences using ensemble models. The ensemble models were then subjected to evaluation using k-fold cross-validation and independent testing. Results: The results revealed that the stacking ensemble model outperformed all the ensemble models by revealing 0.98 accuracy, 0.98 specificity, 0.97 sensitivity, and 0.92 Matthews Correlation Coefficient. The proposed model, iDHU-Ensem, was also compared with pre-existing predictors using an independent test. The accuracy scores have shown that the proposed model in this research study performed better than the available predictors. Conclusion: The current research contributed towards the enhancement of D site identification capabilities through computationally intelligent methods. A web-based server, iDHU-Ensem, was also made available for the researchers at https://taseersuleman-idhu-ensem-idhu-ensem.streamlit.app/.

5.

DHU-Pred: accurate prediction of dihydrouridine sites using position and composition variant features on diverse classifiers.

Suleman, Muhammad Taseer; Alkhalifah, Tamim; Alturise, Fahad; Khan, Yaser Daanial.

PeerJ ; 10: e14104, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-36320563

RESUMEN

Background: Dihydrouridine (D) is a modified transfer RNA post-transcriptional modification (PTM) that occurs abundantly in bacteria, eukaryotes, and archaea. The D modification assists in the stability and conformational flexibility of tRNA. The D modification is also responsible for pulmonary carcinogenesis in humans. Objective: For the detection of D sites, mass spectrometry and site-directed mutagenesis have been developed. However, both are labor-intensive and time-consuming methods. The availability of sequence data has provided the opportunity to build computational models for enhancing the identification of D sites. Based on the sequence data, the DHU-Pred model was proposed in this study to find possible D sites. Methodology: The model was built by employing comprehensive machine learning and feature extraction approaches. It was then validated using in-demand evaluation metrics and rigorous experimentation and testing approaches. Results: The DHU-Pred revealed an accuracy score of 96.9%, which was considerably higher compared to the existing D site predictors. Availability and Implementation: A user-friendly web server for the proposed model was also developed and is freely available for the researchers.

Asunto(s)

Biología Computacional , ARN de Transferencia , Humanos , Biología Computacional/métodos , Aprendizaje Automático , Eucariontes

6.

Automated Analysis of Sleep Study Parameters Using Signal Processing and Artificial Intelligence.

Sohaib, Muhammad; Ghaffar, Ayesha; Shin, Jungpil; Hasan, Md Junayed; Suleman, Muhammad Taseer.

Int J Environ Res Public Health ; 19(20)2022 10 14.

Artículo en Inglés | MEDLINE | ID: mdl-36293844

RESUMEN

An automated sleep stage categorization can readily face noise-contaminated EEG recordings, just as other signal processing applications. Therefore, the denoising of the contaminated signals is inevitable to ensure a reliable analysis of the EEG signals. In this research work, an empirical mode decomposition is used in combination with stacked autoencoders to conduct automatic sleep stage classification with reliable analytical performance. Due to the decomposition of the composite signal into several intrinsic mode functions, empirical mode decomposition offers an effective solution for denoising non-stationary signals such as EEG. Preliminary results showed that through these intrinsic modes, a signal with a high signal-to-noise ratio can be obtained, which can be used for further analysis with confidence. Therefore, later, when statistical features were extracted from the denoised signals and were classified using stacked autoencoders, improved results were obtained for Stage 1, Stage 2, Stage 3, Stage 4, and REM stage EEG signals using this combination.

Asunto(s)

Inteligencia Artificial , Electroencefalografía , Electroencefalografía/métodos , Procesamiento de Señales Asistido por Computador , Fases del Sueño , Sueño , Algoritmos

7.

m1A-pred: Prediction of Modified 1-methyladenosine Sites in RNA Sequences through Artificial Intelligence.

Suleman, Muhammad Taseer; Khan, Yaser Daanial.

Comb Chem High Throughput Screen ; 25(14): 2473-2484, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-35718969

RESUMEN

BACKGROUND: The process of nucleotides modification or methyl groups addition to nucleotides is known as post-transcriptional modification (PTM). 1-methyladenosine (m1A) is a type of PTM formed by adding a methyl group to the nitrogen at the 1st position of the adenosine base. Many human disorders are associated with m1A, which is widely found in ribosomal RNA and transfer RNA. OBJECTIVE: The conventional methods such as mass spectrometry and site-directed mutagenesis proved to be laborious and burdensome. Systematic identification of modified sites from RNA sequences is gaining much attention nowadays. Consequently, an extreme gradient boost predictor, m1A-Pred, is developed in this study for the prediction of modified m1A sites. METHODS: The current study involves the extraction of position and composition-based properties within nucleotide sequences. The extraction of features helps in the development of the features vector. Statistical moments were endorsed for dimensionality reduction in the obtained features. RESULTS: Through a series of experiments using different computational models and evaluation methods, it was revealed that the proposed predictor, m1A-pred, proved to be the most robust and accurate model for the identification of modified sites. AVAILABILITY AND IMPLEMENTATION: To enhance the research on m1A sites, a friendly server was also developed, which was the final phase of this research.

Asunto(s)

Inteligencia Artificial , ARN , Humanos , ARN/genética , ARN/química , Secuencia de Bases , Nucleótidos/química

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA